Learning Extraction of Chinese Comparative Sentences for Evaluative Text
نویسندگان
چکیده
With the prevalence of Web 2.0, people increasingly prefer to express opinions and exchange information through CGM (consumer-generated media), such as blog, Internet forum and etc. Many studies pay attention to extract and analysis user opinions in consumer reviews. This paper studies how to automatically extract Chinese comparative sentences from consumer reviews. At first, the paper describes a method for solving the class imbalance problem of comparatives and non-comparatives in review data. Then we built a support vector machine learning model to classify comparatives and noncomparatives into different group on a balanced dataset. Experiments were conducted on consumer-generated product reviews, including 9600 sentences, of which 1,624 (16.92% of the total) were comparisons. Experiments show an overall F-score of 87.26%, which presents the effectiveness of the proposed approach.
منابع مشابه
Finding relevant features for Korean comparative sentence extraction
In this paper, we study how to extract comparative sentences from Korean text documents. We decompose our task into three steps: 1) collecting comparative keywords; 2) extracting comparative-sentence candidates by keyword searching; 3) eliminating non-comparative sentences from these candidates using machine learning techniques. We perform various experiments to find relevant features. As a res...
متن کاملEXTRACTION-BASED TEXT SUMMARIZATION USING FUZZY ANALYSIS
Due to the explosive growth of the world-wide web, automatictext summarization has become an essential tool for web users. In this paperwe present a novel approach for creating text summaries. Using fuzzy logicand word-net, our model extracts the most relevant sentences from an originaldocument. The approach utilizes fuzzy measures and inference on theextracted textual information from the docu...
متن کاملExtracting Comparative Entities and Predicates from Texts Using Comparative Type Classification
The automatic extraction of comparative information is an important text mining problem and an area of increasing interest. In this paper, we study how to build a Korean comparison mining system. Our work is composed of two consecutive tasks: 1) classifying comparative sentences into different types and 2) mining comparative entities and predicates. We perform various experiments to find releva...
متن کاملMining Comparative Sentences and Relations
This paper studies a text mining problem, comparative sentence mining. A comparative sentence expresses an ordering relation between two sets of entities with respect to some common features. For example, the comparative sentence “Canon’s optics are better than those of Sony and Nikon” expresses the comparative relation: (better, {optics}, {Canon}, {Sony, Nikon}). Given a set of evaluative text...
متن کاملA Rule Based Approach for Analysis of Comparative or Evaluative Questions in Tourism Domain
Comparative or evaluative questions are the non-factoid class of questions that contain comparative or evaluative keywords, which may or may not be directly quantifiable. This entails the need for extraction of comparative and evaluative features, identification of semantic meaning of those features and converting them to quantifiable criteria before data can be obtained from the source text. T...
متن کامل